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Abstract 

The genetic diversity of 16 canine coronavirus (CCoV) samples is described. Samples were obtained from pups infected naturally 
living in different areas. Sequence data were obtained from the M gene and pol la and pol lb regions. The phylogenetic relationships 
among these sequences and sequences published previously were determined. The canine samples segregated in two separate clusters. 
Samples of the first cluster were intermingled with reference strains of CCoV genotype and therefore could be assigned to this 
genotype. The second cluster segregated separately from CCoV and feline coronavirus genotypes and therefore these samples may 
represent genetic outliers. The reliability of the classification results was confirmed by repeating the phylogenetic analysis with 
nucleotide and amino acid sequences from multiple genomic regions. 

© 2002 Elsevier Science B.V. All rights reserved. 
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1. Introduction 

Coronaviruses, a family of viruses belonging to the 
newly established order of the Nidovirales (Siddell, 1995; 
de Vries et ah, 1997; Enjuanes et al., 2000b), are 
pathogens of mammals and birds; based on genetic 
comparison, three distinct clusters can be distinguished. 
One of these clusters also shows antigenic cross-reactiv¬ 
ity and contains the canine coronavirus (CCoV), the 
swine transmissible gastroenteritis virus (TGEV), the 
porcine epidemic diarrhoea virus, the feline corona¬ 
viruses (FCoVs) and the human coronavirus 229E 
(HCoV 229E). Virus particles are enveloped and contain 
a non-segmented, positive stranded RNA genome 27-32 
kb in size (Siddell, 1995; Lai and Cavanagh, 1997). The 
genome contains two large open reading frames (ORFs), 
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la and lb, encoding two polyproteins leading to the 
viral replicase formation. Downstream to the ORFlb, 
there are 8-10 small ORFs, expressed through sub- 
genomic mRNAs and coding for the structural proteins 
S, E, M, the nucleocapsid (N) protein, and a number of 
presumptive non-structural proteins (Luytjes, 1995; 
Enjuanes et ah, 2000a) (Fig. 1). 

The trimeric spike (S) protein, the major inducer of 
virus-neutralizing antibodies (Gebauer et ah, 1991), 
forms characteristic viral peplomers which are involved 
in virus attachment on the cell receptors and in virus-cell 
fusion (de Haan et al., 2000). The small membrane 
protein (E), recognized recently as a structural compo¬ 
nent of the coronavirions, has been found to be 
important for viral envelope assembling (Raamsman et 
ah, 2000). The M protein, the most abundant structural 
component, is a type 111 glycoprotein consisting of a 
short amino-terminal ectodomain, a triple-spanning 
transmembrane domain, and a long carboxyl-terminal 
inner domain (Rottier, 1995). The M protein induces 
antibody-dependent complement-mediated virus neutra- 
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Fig. 1. Schematic representation of CCoV genome. Arrows indicate the position of the genome fragments analysed. 


lization although its function has not yet been fully 
clarified (Woods et al., 1987). 

One of the most interesting aspects of coronavirus 
replication is the occurrence of high-frequency homo¬ 
logous RNA recombination (Lai et al., 1985; Makino et 
al., 1986). This process is thought to be mediated by a 
‘copy-choice’ mechanism (Cooper et al., 1974; Kirke- 
gaard and Baltimore, 1986; Makino et al., 1986). 
Recombination of coronavirus genomes has been ob¬ 
served in tissue culture (Lai et al., 1985; Makino et al., 
1986; Sanchez et al., 1999; Kuo et al., 2000), in animals 
infected experimentally (Keck et al., 1988), and in 
embryonated eggs (Kottier et al., 1995). There is also 
evidence for homologous recombination occurring in 
mouse hepatitis virus in the field (Kusters et al., 1990; 
Wang et al., 1993; Jia et al., 1995) and recent findings 
suggest that it may also be an important factor in the 
evolution of FCoVs (Herrewegh et al., 1995, 1998; 
Vennema et al., 1995; Motokawa et al., 1996). Two 
FCoV serotypes (I and II) have been demonstrated that 
are distinguishable readily in vitro by virus neutraliza¬ 
tion assay (Hohdatsu et al., 1991, 1992). The FCoV type 
II strains have a CCoV-like S gene which is thought to 
provide a growth advantage or a mechanism for 
escaping from the immune response (de Groot et al., 
1987). Comparative sequence analysis of type I and type 
II FCoV and CCoV genomes suggests that the FCoV 
type II strains arose from a homologous RNA recom¬ 
bination between CCoV and a type I FCoV (Herrewegh 
et al., 1995; Vennema et al., 1995; Motokawa et al., 
1996). 

Accumulation of point mutations (as well as small 
insertions and deletions) in coding and non-coding 
sequences is the dominant force in the microevolution 
of (+) RNA viruses, resulting in proliferation of virus 
strains, serotypes and subtypes (Dolja and Carrington, 
1992). Extremely large ( + ) RNA virus genomes, such as 
those of coronaviruses, are predicted to accumulate 
several base substitutions per round of replication 
because of the high error frequencies of RNA polymer¬ 
ase (Jarvis and Kirkegaard, 1991). 

Comparative studies of CCoV strains have been 
hampered because only few strains proved to be suitable 
for growth in vitro. The development of PCR assays for 
CCoV (Bandai et al., 1999; Pratelli et al., 1999b; Naylor 
et al., 2001) has provided important information on the 
epidemiology of CCoV, which has been detected in as 


much as one-third of the pups affected with diarrhoea 
(Pratelli et al., 2000), often as co-infections with other 
canine pathogens such as parvovirus, rotavirus and 
adenovirus (Appel et al., 1979; Yasoshima et al., 1983; 
Martin and Zeidner, 1992; Pratelli et al., 1999a, 2001a). 

This paper describes a comparative analysis of the 
genes encoding for both the M protein and polymerase 
performed on CCoV isolates and faecal samples of pups 
that were positive for CCoV by PCR. 


2. Materials and methods 

2.1. Virus strains and clinical samples 

The reference CCoV strain S378 (supplied by L.E. 
Carmichael, Cornell University, Ithaca, NY), and the 
cell culture-adapted CCoV strains, 45/93 (Buonavoglia 
et al., 1994), A-2/99, 144/01, 257/98 and 65/00-30, were 
tested. Virus isolates were grown on the A-72 canine cell 
line. Eleven faecal samples (107/99, A-32/99, 79/99, 74/ 
99B, 74/99C, 103/99A, 83/99, 103/99E, C-27/3, 93/00, 
259/01) from 2 to 6 month-old pups affected with 
diarrhoea and negative to virus isolation but PCR 
positive to CCoV, were also tested. All the pups came 
from urban areas of different regions of southern Italy 
and they were either free-roaming dogs or family pets. 
Faecal samples were collected into sterile tubes, trans¬ 
ported to our laboratories and processed within 48 h. 

2.2. In vitro amplification by PCR 

Viral RNA was extracted from faecal samples and 
from infected A-72 cells using the RNeasy kit (Qiagen 
GmbH, Germany). Reverse transcriptase (RT)-PCR 
was carried out using the primer pair M5/M6 that 
amplified the M gene of CCoV. Briefly, reverse tran¬ 
scription was performed in a total reaction volume of 20 
pi containing PCR buffer 1 x (KC1 50 mM, Tris-HCl 
10 mM, pH 8.3), MgCl 2 5 mM, 1 mM of each 
deoxynucleotide (dATP, dCTP, dGTP, dTTP), RNase 
1 U, MuLV RT 2.5 U, random hexamers 2.5 U. 
Synthesis of cDNA was carried out at 42 °C for 30 
min, followed by a denaturation step at 99 °C for 5 min. 
The mixture was made up to a total volume of 100 pi, 
containing PCR buffer 1 x , MgCl 2 2 mM, Amplitaq 
Gold DNA polymerase 2.5 U and 50 pmol of each 
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primers M5 and M6. The PCR mixture was subjected to 
35 cycles (94 °C for 1 min, 55 °C for 1 min, 72 °C for 
1 min) in a DNA Thermal Cycler. 

In addition, RT-PCR assays were carried out on four 
fragments of the polymerase gene located in the ORFla 
region, fragments A (p493/p247) and B (p685/p670), and 
ORFlb region, fragments C (p650/p652) and D (p686/ 
p687), as reported by Herrewegh et al. (1998). C-DNA 
syntheses were undertaken at 42 °C for 30 min, 
followed by 5 min at 99 °C for all the fragments. The 
PCR reaction was carried out for 40 cycles (94 °C for 1 
min, 55 °C for 1 min, 72 °C for 1 min). The sequence 
and position of all the primers are shown in Table 1. 

2.3. Sequencing and phytogeny 

The PCR products were purified through Ultrafree- 
DA columns (Amicon, Millipore, Bedford, USA) and 
subjected to sequence analysis with the dideoxynucleo- 
tide chain-termination method, using ABI-PRISM 377 
(Perkin Elmer, Applied Biosystem Instruments). For 
each region, the nucleotide sequence of each sample was 
determined in both directions on at least two separate 
amplicons. Sequence data were aligned to reference 
strains by using with CLUSTALW program (version 
1.8) with the following settings: gap opening penalty of 
10.0 and gap extension penalty of 10.0 with transition 
weighted, and manually refined with GENEDOC (ver¬ 
sion 2.6.002; http://www.psc.edu/biomed/genedoc). The 
phylogenetic methods used were from phylip software 
package developed by J. Felsestein (version 3.5c; http:// 
evolution.genetics.washington.edu/phylip.html). Nu¬ 
cleotide and amino acid distances were calculated by 
Kimura’s two-parameter and p -distance methods, re¬ 
spectively. Phylogenetic trees were produced by using 
the NEIGHBOR-JOINING method on 1000 bootstrap 


replicates, followed by FITCH-MARGOLIASH to 
obtain proportional branch lengths (nucleotide and 
amino acid sequences), and by DNAML with a transi¬ 
tion/transversion ratio of 2.000 and randomizing the 
data input (nucleotide sequences), and PROTPARS on 
500 bootstrap replicates (amino acid sequences). Trees 
were drawn with treeview (version 1.6.6; http://tax- 
onomy.zoology.gla.ac.uk/rod/rod.html). 

2.4. Nucleotide sequence accession number 

The nucleotide sequence of the sample 259/01 will 
appear in the DDBJ/EMBL/GenBank databases under 
accession no. AF502583. The references for and the 
nucleotide sequence accession numbers of the M and 
polymerase genes of the coronavirus strains mentioned 
in this study are reported in Table 2. 


3. Results 

A total 17 samples were analyzed: the reference CCoV 
strain S378, five cell culture-adapted CCoV strains and 
eleven faecal samples of pups. PCR amplicons of the 
expected size were obtained with the primer pairs M5/ 

Table 2 

Accession numbers of the M and pol genes of the strains mentioned in 
this study 


Strain Accession Reference 


Insavc 

D13096 

Horsburgh et al. (1992) 


FCoV 79-1146 

AF326575 

de Groot et al. (1987) 


FCoV 79-1683 

Y13921 

Herrewegh et al. (1998) 


FCoV TN406 

X90570 

Herrewegh et al. (1998) 


FCoV UCD-1 

X90575 

Herrewegh et al. (1998) 


TGEV-Purdue 

NC_002306 

Almazan et al. (2000) 



Table 1 

Primers used for PCR amplification and sequence analysis 


Primer 

Sequence 5'-3' 

Sense 

Position 

Amplificon size (bp) 

M5 

CCTT GTTT G A ACT A AAC AA AAT G AAG 

+ 

6382-6407“ 

835 

M6 

TCCCTGAGAGGCCATTTAGA 

- 

7197-7217“ 


p493 b 

GTAGTAGCAT C ACTCT GC ACTTC 

+ 

1575-1597“ 

320 

p247 b 

CCGTTGT GGTGTCACATTAAC 

- 

1895-1875“ 


p685 b 

GCCCTT G AGT AT A ACTT GT GAG 

+ 

11 531-11 552“ 

389 

p670 b 

TCAGTGGTTTACCAGCTTGCA 

- 

11920-11899“ 


p650 b 

TGGT GCTGTT GCT GAGC AT G A 

+ 

12 574-12 594“ 

749 

p652 b 

ACAACTACTGGTACACCAT CAA 

- 

13 323-13 302“ 


p686 b 

GAAACTGTGAAAGCTAAGGAGG 

+ 

15 500-15 521“ 

472 

p687 b 

ACACAATGAGATTTACCGCTACC 

- 

15 972-15 950“ 



“ Primers position is referred to the sequence of stain Insavc (accession: D13096). 
b Herrewegh et al., 1998. 

0 Primers locations are indicated as originally reported in the corresponding papers. 
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M6, p493-p247, p685/p670, p650/p652, p686/p687. 
Some pol regions of several faecal samples were not 
amplified, probably due to viral loads close to the 
sensitivity threshold of the RT-PCR assays. The ob¬ 
servation that in vitro-grown isolates were amplifiable 
with all primer sets used (excepted 65/00-30 isolate that 
scored negative with pol la B primers) strengthened this 
hypothesis (Table 3 for the details). 

Multiple alignments showed that the studied se¬ 
quences differed from each other, thus ruling out 
possible contaminations. Multiple nucleotide alignments 
deduced from pol la and pol lb regions were straightfor¬ 
ward, whereas those obtained from the M gene exhib¬ 
ited numerous insertions and deletions and needed to be 
refined by manual editing. Alignment of deduced amino 
acid sequences showed that the great majority of 
nonsynonymous substitutions were localized within the 
NH 2 -end ectodomain of the M protein. Although no 
consistent pattern was observed, all the insertions and 
deletions were in-frame and located in four defined 
positions. Moreover, initial direct inspection revealed 
that seven Italian samples analysed contained amino 
acid residues which were typical of FCoV isolates: lie -> 
Val-128, Ile/Val->Ala-132, Val^Thr-178, Ile^Met- 
197, Asp->Glu-205, Asn -+ His-206, Lys ^Gln-228, 
Asp -» Glu-251 and Gin —> His-260; thus highlighting 
the fact that these specimens contained coronavirus 
RNA with more deduced amino acid similarity to FCoV 
than to CCoV in this region (Fig. 2). 

Phylogenetic relationships were inferred from the 
genetic distances by constructing phylogenetic trees 
using the distance-based methods NEIGHBOR-JOIN¬ 
ING and FITCH-MARGOL1ASH, and the character- 


state method DNAML. For this analysis, distance 
values for the M gene were recalculated by trimming 
the ectodomain region (i.e. by reducing the sequence 
stretch analyzed from 835 to 681 bp) which did not 
allow precise phylogenetic inference because of its high 
heterogeneity (homoplasy). Trees were rooted by using 
the porcine TGEV isolate Purdue as an outgroup. To 
appraise the sensitivity of NEIGHBOR-JOINING to 
the input order of species, the program was run, in a 
pilot study, by using either fixed or multiple randomized 
orders of M sequences and by repeating the process ten 
times. The resulting trees differed slightly but phy- 
logroups and outliers were identical (trees not shown). 
Since the randomizing option was time-consuming and 
did not influence segregation, NEIGHBOR-JOINING 
trees were produced from a fixed order of bootstrapped 
sequences only and by using the multiple data set option 
(100 reiterations). Branch lengths were then calculated 
from the consensus trees using FITCH-MARGO- 
LIASH. Topologies of trees from all of the genomic 
regions analyzed were very similar, the major phyloge¬ 
netic lineages defining the FCoV and CCoV genotypes 
were delineated clearly and generally supported by high 
bootstrap values. Irrespective of the genome portion 
analyzed, the CCoV strains Insavc, S378, 45/93, 257/98, 
A-2/99, 144/01 and 65/00-30, and FCoV TN406 and 
UCD1, clustered into different phylogroups. Both the 
FCoV type II isolates segregated together with the 
CCoV isolates when analyzed for pol lb (region D); 
when analyzed for pol lb (region C) only the FCoV 79- 
1683 strain segregated with CCoV. 

The Italian samples analyzed segregated in two 
different clusters, indicating a distinct genetic related- 


Table 3 

List of the viruses analysed in this study: the genome fragments successfully amplified and sequenced are indicated 


Samples 

M 

pol la (A) 

pol la (B) 

pol lb (C) 

po !lb (D) 

S/378 

+ 

+ 

+ 

+ 

+ 

257/98 

+ 

+ 

+ 

+ 

+ 

45/93 

+ 

+ 

+ 

+ 

+ 

83/99 

+ 

- 

- 

- 

- 

93/00 

+ 

+ 

+ 

+ 

+ 

A-2/99 

+ 

+ 

+ 

+ 

+ 

144/01 

+ 

+ 

+ 

+ 

+ 

65/00-30 

+ 

+ 

- 

+ 

+ 

103/99A 

+ 

+ 

+ 

+ 

+ 

103/99E 

+ 

+ 

+ 

+ 

+ 

C-27/3* 

+ 

+ 

- 

+ 

+ 

107/99* 

+ 

+ 

- 

+ 

- 

74/99C* 

+ 

+ 

- 

+ 

+ 

74/99B* 

+ 

+ 

- 

- 

+ 

79/99* 

+ 

- 

- 

- 

+ 

A-32/99* 

+ 

+ 

+ 

- 

+ 

259/01* 

+ 

+ 

+ 

+ 

+ 


+ , amplified; —, not amplified. 
* FCoV-like CCoVs. 
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Insavc 

MKKILFLLAC 

AIACVYGE— 

S-378-fasta 


- 

— 

-c..— 

257/98 

-M...LI.. 


.V. . 

....QI 

45/93 

-M...LI.. 


.V. . 

....QI 

83/99 

-M...LI.. 


.V. . 

....QI 

93/00 

-M. 


...FAC..— 

A-2/99 

-M...LI.. 


• V. . 

....QI 

144/01 



...FAC..-- 

65/00-30 

-M...LI.. 


.V. . 

....QI 

103/99A 

-M...LI.. 


• V.. 

....QI 

103/99E 

.LI.. 


• V.. 

. . . .- 

C-27/3* 

-M...LIP. 



. . . .- 

107/99* 

.L. I. 



_ 

74/99C* 

-M...L.I. 



. . . .- 

74/99B* 

-M...L.I. 



. . . .- 

79/99* 

.L.I. 



_ 

A-32/99* 

.L.T. 




259/01* 

-. T,. I. 




79-1683type2 

-M...LI.. 


• V. . 

....QI 

79-1146type2 

..Y..LI.. 


I... 

. . . .- 

TN406 

•NYV.LI.. 


I... 

• F. .QI 

UCD1 

..YL..IF. 


I... 

....QI 


RYCAM 

TE—S 

STSCRNSTAG -NCAS 

3FETG 

DLIWHLANWN FSWSVILIIF 


KSd-. 

E. . . 

.T -D.E. 

. .NG. 

.I...v. 


3 .—T 

GL. . 

.,G..S -D.E. 

. .NG. 

.I..,v. 


3.—T 

GL. . 

..G..S -D.E. 

. .NG. 

.I..,v. 


3.—T 

GL. . 

..G..S -D.E. 

. .NG. 

.I..,v. 


K. — . 

D. . . 

....TT -D.E. 

. .NG. 

.V. 


3.—T 

GL. . 

.,G..S -D.E. 

. .NG. 

.I...v. 


KSd-. 

D. . . 

....TT -D.E. 

. .NG. 

.V. 


3. —T 

GL. . 

.,G..S -D.E. 

. .NG. 

.I..,v. 


3. —T 

GL. . 

..G..S -D.E. 

. .NG. 

.I..,v. 


3.—T 

GL. . 

..G..S -D.E. 

. .NG. 

.I_v. 


. DGVD 


. .N.SS SA. .. 

. INS. 

.V. 


3Td-N 

PM. . 

. .N.DS -K... 

. . .O. 

.V. 


. DEST 

.ML. 

...,s. -S. .. 

. .NG . 


. DEST 

.ML. 

_.s. -S _ 

. . NG . 


KNd-E 

PM. . 

.S -E . . . 

_O. 


. DGVD 


..N.SS SA. .. 

. INS. 

.V. 


KNd-E 

PM. . 

. S -E. .. 

. . .Q. 



3. — T 

GL. . 

. ,G.,S -D.E. 

. .NG. 

.I..,v. 


3d —. 

GLQ. 

I.G.NS -R.QT 

. . .R. 

.V. 


KDd- . 

.NT. 

I.G.NS -S.QT 

. . .R. 

. V. 


KSd- . 

G.Q. 

VYG.NS -T.QT 

. . .R. 

.V. 




— 


[ 70] 
[ 70] 
[ 70] 
[ 70] 
[ 70] 
[ 70] 
[ 70] 
[ 70] 
t 70] 
[ 70] 
[ 70] 
[ 70] 
[ 70] 
[ 70] 
[ 70] 
[ 70] 
[ 70] 
[ 70] 
t 70] 
[ 70] 
[ 70] 
[ 70] 


Insavc ITVLQYGRPQ FSWFVCGIKM LIMWLLWPIV LALTIFNAYL EYRVSRYVMF GFSVAGATVT FILWIMYFVR 

S-378-fasta .Y.S ..E. PI .I. 

257/98 .YS.T_S ..E.I...II. .V. 

45/93 .Y.T_S ..E.I...II. .V. 

83/99 .YS.T_S ..E.II. .V. 

93/00 .Y.S ..E.I. 

A-2/99 .YS.T_S ..E.I...II. .V. 

144/01 .Y.S ..E.I. 

65/00-30 .YS.T_S ..E.I...II. .V. 

103/99A .Y.T_S ..E.I. 

103/99E .Y.T_S ..E.I. 

C-27/3* Y.S ..E.I...V.. .A. 

107/99* Y.S ..E.I...VI. .A. 

74/99C* Y.S ..E.I...VI. .A. 

74/99B* Y.S ..E.I...VI. .A. 

79/99* Y.S ..K.I...VI. .A..L. 

A-32/99* Y.S ..E.I...VI. .A. 

259/01* Y.S ..K.I...VI. .A. 

79-1683type2 .Y.S . .E.V.. .A. .M. 

79-1146type2 .L.Y.S ..Q.V. . .A..M. 

TN406 L.Y.S ..Q_L.I...F.. .A. .M. 

UCD1 L.Y.S ..Q.VI. .A..M. 


[140] 

[140] 

[140] 

[140] 

[140] 

[140] 

[140] 

[140] 

[140] 

[140] 

[140] 

[140] 

[140] 

[140] 

[140] 

[140] 

[140] 

[140] 

[140] 

[140] 

[140] 

[140] 


Insavc 

SIQLYRRTKS 

WWSFNPETSA 

ILCVSALGRS 

YVLPLEGVPT 

GVTLTLLSGN LCAEGFKIAG 

GMNIDNLPKY 



.N. 



.Y. . 



257/98 

45/93 

83/99 

93/00 

A-2/99 

144/01 

65/00-30 

103/99A 

103/99E 

C-27/3* 

107/99* 

74/99C* 

74/99B* 

79/99* 

A-32/99* 

259/01* 

79-1683type2 


.N. 



.Y. . 







.Y. . 




.N. 



.Y. . 




.N. 



.Y. . 




.N. 



.Y. . 




.N. 



.Y. . 




.N. 



.Y. . 




.N. 



.Y. . 




.N. 



.Y. . 




.N. 


.T. . 

.Y. . 

. .M. . 



.N. 


.T. . 

.Y. . 

. .M. . 

....EH.... 


.N. 


.T. . 

.Y. . 

. .M. . 



.N. 


.T. . 

.Y. . 

. .M. . 



.N. 


.T. . 

.Y. . 

. .M. . 

....EH.... 


.N. 


.T. . 

.Y. . 

. .M. . 

.... EH.... 


.N. 


.T. . 

.Y. . 

. .M. . 



.N. 

_N. 

.D.T.. 

.Y. . 

. .M. . 

.LT.EH.... 

79-1146type2 

.V. 

.N. 

_N. 

.D.T.. 

.Y. . 

. .M. . 

.LT.EH.... 

TN406 


.N. 

_N. 

.D.T.. 

.Y. . 

. .M. . 

• LT * EH•••• 

UCD1 


.N. 

_N_R 

.D.T.. 

.Y. . 

. .M. . 

•LT.EH•••• 


[ 210 ] 

[ 210 ] 

[ 210 ] 

[ 210 ] 

[ 210 ] 

[ 210 ] 

[ 210 ] 

[ 210 ] 

[ 210 ] 

[ 210 ] 

[ 210 ] 

[ 210 ] 

[ 210 ] 

[ 210 ] 

[ 210 ] 

[ 210 ] 

[ 210 ] 

[ 210 ] 

[ 210 ] 

[ 210 ] 

[ 210 ] 

[ 210 ] 


Fig. 2. Amino acid sequence of the M gene of coronaviruses from dog and cat. Asterisks indicate FCoV-like CCoVs. The NH 3 -terminus ectodomain 
is boxed. 
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Insavc VMVALPVRTI VYTLVGKKLK ASSATGWAYY VKSKAGDYST DARTDNLSEH EKLLHMV* 


S-378-fasta .S.Q 

257/98 S.Q.Q 

45/93 S.Q.Q 

83/99 S.Q 

93/00 S.Q 

A-2/99 S.Q 

144/01 S.Q 

65/00-30 S.Q.Q 

103/99A S.Q 

103/99E S.Q 

C-27/3* S.Q.E. 

107/99* S.Q.E. 

74/99C* S.Q.E. 

74/99B* S.Q.E. 

79/99* S.Q.E. 

A-32/99* .SS.Q.T. E. 

259/01* S.Q.E. 

79-1683type2 ..I.T.N.Q. . .TT.E. 

79-1146type2 . .I.T.S.Q. . .TT.E. 

TN406 ..I.T.S.Q.. .TT.E. 

UCD1 ..I.T.S.Q.. .TT.E..P. 


[ 268 ] 

[ 268 ] 

[ 268 ] 

[ 268 ] 
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Fig. 2 ( Continued ) 


ness. This result was clearly evident in the trees obtained 
from the M gene where samples 83/99, 93/00 103/99A 
and 103/99E clustered with prototype canine isolates 
thus behaving as typical CCoV strains, whereas samples 
A32/99, C27/3, 74/99B, 74/99C, 79/99, 107/99 and 259/01 
segregated together giving rise to an outlier cluster 
localized between the feline and canine genotypes. 
Segregations were supported by high bootstrap values 
and statistical significance thus emphasizing the robust¬ 
ness of tree topology (Fig. 3a). Although the number of 
samples analyzed was smaller and the bootstrap values 
were lower than for the M gene, the segregation pattern 
of Italian samples was observed consistently also when 
analyzed in pol 1 b-rcgion D (Fig. 3e). The topology of 
the trees built with the other genome stretches shown 
confirmed the presence of typical CCoV strains and 
CCoVs that could not be assigned to either CCoV or 
FCoV genotypes, although these samples did not fall 
within a single branch but were dispersed into smaller 
clusters and located between the outgroup strain and 
samples belonging to FCoV genotype (Fig. 3b-d). 
Similar findings were also observed by analyzing the 
amino acid sequences (data not shown). Altogether, 
these results suggested that outlier samples may have 
undergone partly separate evolutions from typical 
CCoV strains. 


4. Discussion 

The main purpose of this study was to investigate the 
extent of genetic diversity among the CCoVs circulating 
in Italy and to examine their relationships to strains 
characterized previously. Although our findings are 


based on a number of samples that is too small to yield 
firm conclusions, they provide however evidence that 
the circulating CCoVs in Italy segregate into two 
separate groups. Interestingly, in the M and pol lb- 
region D trees the outlier samples were related closely to 
each other and formed a single phylogroup. Pairwise 
genetic distances as estimated from multiple nucleotide 
alignments of these sequences together with those of 
other reference isolates readily allowed to distinguish the 
different genotypes, indicating that the fragments ana¬ 
lyzed contained sufficient information to characterize 
the coronavirus-positive samples. The overall nucleotide 
divergence, shown in Table 4, showed that Italian 
samples were highly heterogeneous with distance values 
that were not dissimilar from those found between 
CCoV and FCoV isolates. The matrix of pairwise 
genetic distances showed that most Italian samples 
were closely related to the CCoV genotype but the 
C27/3, 74/99B, 74/99C, A32/99, 79/99, 259/01 and 107/99 
samples clearly differed from the others, as they 
exhibited high distance values from the other Italian 
isolates and from the reference genotype CCoV isolates 
(data not shown) and have been therefore tentatively 
called FCoV-like CCoVs. A possible explanation for 
this phenomenon is that under natural conditions 
homologous recombination events between highly 
homologous coronaviruses, such as CCoV and FCoV, 
occur very frequently. This means that there is a 
frequent interspecific circulation of either CCoV to 
cats or FCoV to dogs since mixed infections are required 
to give rise to recombination. Where the recombination 
takes place is unknown, but under experimental condi¬ 
tions cats can be infected with CCoV (Barlough et al., 
1984; Stoddart et al., 1988; McArdle et al., 1990, 1992) 
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Fig. 3. Evolutionary relationships among coronavirus strains based on nucleotide sequences of M gene (a), po/la-region A (b), po/la-region B (c), po/lb-region C (d) and po/lb-region D (e). 
Branching pattern of the trees was obtained by NEIGHBOR-JOINING. Bootstrap values above 60 out of 100 are shown at the branch points. Double asterisks indicate branches at P <0.01 as 
calculated by DNAML. Bar indicates the number of nucleotide substitutions/site, calculated by the Kimura’s two-parameter model with a transition/transversion of 2.00. Numbers indicate samples 
amplified and sequenced from faecal samples. Empty and solid circles indicate, respectively, FCoV isolates listed in Table 2 and CCoV isolates amplified front cell cultures and the Insavc strain 
retrieved from GenBank. In the sake of simplicity, the recombinant FCoV type II strains 79-1146 and 79-1163 are also indicated by their initials. 
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Table 4 

Nucleotide variations among CCoVs and FCoVs 


Typical CCoVs 



FCoV-like CCoVs 



FCoVs 






M 

A 

B 

C 

D 

M 

A 

B 

C 

D 

M 

A 

B 

C 

D 


4.98 

5.75 

5.49 

2.75 

2.00 

11.74 

9.31 

7.27 

5.15 

7.86 

17.51 

19.74 

16.21 

12.62 

13.64 

Typical CCoVs 






7.02 

8.82 

10.19 

5.54 

3.28 

15.93 

19.25 

16.48 

12.08 

11.41 

FCoV-like CCoVs 











5.06 

6.68 

4.41 

3.50 

4.00 

FCoVs 


Values indicate the arithmetic average x extrapolated from the nucleotide matrix of comparison and are expressed in %. M, M gene. A, 320 bp 
fragment of the ORFla. B, 389 bp fragment of the ORFla. C, 749 bp fragment of the ORFlb. D, 472 bp fragment of the ORFlb. Due to 
recombination events affecting the genome of FCoVs type II, the values referred to strains 79-1683 and 79-1146 were not considered in C and D. 


and, although it has not been demonstrated that FCoV 
may infect dogs cannot be ruled out. According to this 
hypothesis, restricted sites of recombination (i.e. regions 
of very high nucleotide identity between two highly 
homologous coronaviruses) may exist in the genome of 
coronaviruses and these sites may readily undergo 
template switches. This explanation assumes that cor¬ 
onaviruses of domestic carnivores possess a sort of 
‘dynamic’ genome. However, the presence of numerous 
synapomorphic (shared-derived) signals allowing seg¬ 
regation of outlier FCoV-like CCoVs samples in a single 
phylogroup, in the M gene and pol lb-region D is 
compatible with the idea that these samples diverged 
at approximately the same time from an ancient 
common ancestor and subsequently underwent indepen¬ 
dent evolutions. This hypothesis is also consistent with 
the appreciable variations detected throughout the 
fragments of the pol la and pol lb of FCoV-like CCoVs 
as compared to typical CCoVs. Considerable nucleotide 
variations, affecting the reliability of a CCoV diagnostic 
test (nested-PCR), have been already described in a 
fragment of the M gene of some CCoV strains (Pratelli 
et al., 2001b). 

Interestingly, sequence analysis on the whole M 
protein of CCoVs revealed a high amino acid variability 
in the amino-terminal end which is exposed on the 
outside of the virion according to the topological model 
proposed by Risco et al. (1995); whereas in the sequence 
of the transmembrane and of the carboxyl-terminal 
domains there were only a few scattered substitutions 
(Fig. 2) and most of the substitutions occurring in the M 
protein of FCoV-like CCoVs were similar to residues 
typical of FCoV. Although the major immunological 
role has been attributed to the S protein, both the 
amino- and the carboxy-termini of the M protein of 
coronavirus elicit a strong immune response (Enjuanes 
et al., 2000a). Therefore, the variations observed in the 
M protein might confer new functions and/or properties 
to the virus resulting in survival advantages, such as 
escape from the immune system. 

Comparative analysis on other genomic fragments of 
CCoVs will help to elucidate the evolution and relation¬ 


ships between coronaviruses in domestic carnivores, 
whereas studies on the antigenic correlations between 
‘typical’ and ‘atypical’ CCoVs will provide insight into 
the biological role and pathogenesis of FCoV-like 
CCoVs. Large molecular surveys will be necessary to 
determine the real prevalence and divergence from 
classical genotypes of the outlier samples. These data 
will be helpful to discern whether FCoV-like CCoVs 
belong to a new, unrecognized, coronavirus genotype. 
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